Study of Phonemes Confusions in Hierarchical Automatic Phoneme Recognition System

نویسندگان

Rimah Amami

Noureddine Ellouze

چکیده

In this paper, we have analyzed the impact of confusions on the robustness of phoneme recognitions system. The confusions are detected at the pronunciation and the confusions matrices of the phoneme recognizer. The confusions show that some similarities between phonemes at the pronunciation affect significantly the recognition rates. This paper proposes to understand those confusions in order to improve the performance of the phoneme recognition system by isolating the problematic phonemes. Confusion analysis leads to build a new hierarchical recognizer using new phoneme distribution and the information from the confusion matrices. This new hierarchical phoneme recognition system shows significant improvements of the recognition rates on TIMIT database.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Phoneme Classification Using New Feature Extraction Techniques based on Mellin Transform

This paper presents a new hierarchical phoneme recognition system using the SVM classifier and different feature representations based on mellin transform. The proposed architecture uses different representations with each group of phonemes of the speech database TIMIT which are distributed in a way to reduce the confusions between phonemes having similar articulatory strcuture. The main idea o...

متن کامل

Designing and implementing a system for Automatic recognition of Persian letters by Lip-reading using image processing methods

For many years, speech has been the most natural and efficient means of information exchange for human beings. With the advancement of technology and the prevalence of computer usage, the design and production of speech recognition systems have been considered by researchers. Among this, lip-reading techniques encountered with many challenges for speech recognition, that one of the challenges b...

متن کامل

Learning from human errors: prediction of phoneme confusions based on modified ASR training

In an attempt to improve models of human perception, the recognition of phonemes in nonsense utterances was predicted with automatic speech recognition (ASR) in order to analyze its applicability for modeling human speech recognition (HSR) in noise. In the first experiments, several feature types are used as input for an ASR system; the resulting phoneme scores are compared to listening experim...

متن کامل

Allophone-based acoustic modeling for Persian phoneme recognition

Phoneme recognition is one of the fundamental phases of automatic speech recognition. Coarticulation which refers to the integration of sounds, is one of the important obstacles in phoneme recognition. In other words, each phone is influenced and changed by the characteristics of its neighbor phones, and coarticulation is responsible for most of these changes. The idea of modeling the effects o...

متن کامل

Improving Phoneme Sequence Recognition using Phoneme Duration Information in DNN-HSMM

Improving phoneme recognition has attracted the attention of many researchers due to its applications in various fields of speech processing. Recent research achievements show that using deep neural network (DNN) in speech recognition systems significantly improves the performance of these systems. There are two phases in DNN-based phoneme recognition systems including training and testing. Mos...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

CoRR

دوره abs/1508.01718 شماره

صفحات -

تاریخ انتشار 2015

Study of Phonemes Confusions in Hierarchical Automatic Phoneme Recognition System

نویسندگان

چکیده

منابع مشابه

Phoneme Classification Using New Feature Extraction Techniques based on Mellin Transform

Designing and implementing a system for Automatic recognition of Persian letters by Lip-reading using image processing methods

Learning from human errors: prediction of phoneme confusions based on modified ASR training

Allophone-based acoustic modeling for Persian phoneme recognition

Improving Phoneme Sequence Recognition using Phoneme Duration Information in DNN-HSMM

عنوان ژورنال:

اشتراک گذاری